TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining
نویسندگان
چکیده
BACKGROUND Traditional Chinese Medicine (TCM), a complementary and alternative medical system in Western countries, has been used to treat various diseases over thousands of years in East Asian countries. In recent years, many herbal medicines were found to exhibit a variety of effects through regulating a wide range of gene expressions or protein activities. As available TCM data continue to accumulate rapidly, an urgent need for exploring these resources systematically is imperative, so as to effectively utilize the large volume of literature. METHODS TCM, gene, disease, biological pathway and protein-protein interaction information were collected from public databases. For association discovery, the TCM names, gene names, disease names, TCM ingredients and effects were used to annotate the literature corpus obtained from PubMed. The concept to mine entity associations was based on hypothesis testing and collocation analysis. The annotated corpus was processed with natural language processing tools and rule-based approaches were applied to the sentences for extracting the relations between TCM effectors and effects. RESULTS We developed a database, TCMGeneDIT, to provide association information about TCMs, genes, diseases, TCM effects and TCM ingredients mined from vast amount of biomedical literature. Integrated protein-protein interaction and biological pathways information are also available for exploring the regulations of genes associated with TCM curative effects. In addition, the transitive relationships among genes, TCMs and diseases could be inferred through the shared intermediates. Furthermore, TCMGeneDIT is useful in understanding the possible therapeutic mechanisms of TCMs via gene regulations and deducing synergistic or antagonistic contributions of the prescription components to the overall therapeutic effects. The database is now available at http://tcm.lifescience.ntu.edu.tw/. CONCLUSION TCMGeneDIT is a unique database that offers diverse association information on TCMs. This database integrates TCMs with biomedical studies that would facilitate clinical research and elucidate the possible therapeutic mechanisms of TCMs and gene regulations.
منابع مشابه
Text mining for traditional Chinese medical knowledge discovery: A survey
Extracting meaningful information and knowledge from free text is the subject of considerable research interest in the machine learning and data mining fields. Text data mining (or text mining) has become one of the most active research sub-fields in data mining. Significant developments in the area of biomedical text mining during the past years have demonstrated its great promise for supporti...
متن کاملTCMID: traditional Chinese medicine integrative database for herb molecular mechanism analysis
As an alternative to modern western medicine, Traditional Chinese Medicine (TCM) is receiving increasingly attention worldwide. Great efforts have been paid to TCM's modernization, which tries to bridge the gap between TCM and modern western medicine. As TCM and modern western medicine share a common aspect at molecular level that the compound(s) perturb human's dysfunction network and restore ...
متن کاملPublishing Chinese medicine knowledge as Linked Data on the Web
BACKGROUND Chinese medicine (CM) draws growing attention from Western healthcare practitioners and patients. However, the integration of CM knowledge and Western medicine (WM) has been hindered by a barrier of languages and cultures as well as a lack of scientific evidence for CM's efficacy and safety. In addition, most of CM knowledge published with relational database technology makes the int...
متن کاملCommon herbal treatments for senile dementia in ancient civilizations: Greco-Roman, Chinese, Indian, and Iranian
Background: Senile dementia is the most common kind of dementia with considerable social and economic costs. Since the nature of disease is multi-pathological, current treatments cannot cover all aspects of the disease. Recently, scientific considerations have focused on the role of natural products, especially those with traditional backgrounds. Objective: to review natural treatments of demen...
متن کاملGene regulation network fitting of genes involved in the pathophysiology of fatty liver in the mice by promoter mining
Background and Aim: Non-Alcoholic Fatty Liver Disease (NAFLD) is the major cause of chronic liver disease in developed countries. In this study, we identified the most important transcription factors and biological mechanisms affecting the incidence of fatty liver disease using the promoter region data mining. Materials and Methods In this study, at first, the marker genes associated with this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- BMC Complementary and Alternative Medicine
دوره 8 شماره
صفحات -
تاریخ انتشار 2008